首页> 外文OA文献 >A near-threshold RISC-V core with DSP extensions for scalable IoT Endpoint Devices
【2h】

A near-threshold RISC-V core with DSP extensions for scalable IoT Endpoint Devices

机译:具有Dsp扩展的接近门限的RIsC-V内核,用于可扩展的IoT   终端设备

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Endpoint devices for Internet-of-Things not only need to work under extremelytight power envelope of a few milliwatts, but also need to be flexible in theircomputing capabilities, from a few kOPS to GOPS. Near-threshold(NT) operationcan achieve higher energy efficiency, and the performance scalability can begained through parallelism. In this paper we describe the design of anopen-source RISC-V processor core specifically designed for NT operation intightly coupled multi-core clusters. We introduce instruction-extensions andmicroarchitectural optimizations to increase the computational density and tominimize the pressure towards the shared memory hierarchy. For typicaldata-intensive sensor processing workloads the proposed core is on average 3.5xfaster and 3.2x more energy-efficient, thanks to a smart L0 buffer to reducecache access contentions and support for compressed instructions. SIMDextensions, such as dot-products, and a built-in L0 storage further reduce theshared memory accesses by 8x reducing contentions by 3.2x. With fourNT-optimized cores, the cluster is operational from 0.6V to 1.2V achieving apeak efficiency of 67MOPS/mW in a low-cost 65nm bulk CMOS technology. In a lowpower 28nm FDSOI process a peak efficiency of 193MOPS/mW(40MHz, 1mW) can beachieved.
机译:物联网的端点设备不仅需要在几毫瓦的极其紧凑的功率范围内工作,而且还需要灵活的计算能力(从几kOPS到GOPS)。近阈值(NT)操作可以实现更高的能源效率,并且可以通过并行性获得性能可伸缩性。在本文中,我们描述了专门为NT操作紧密耦合的多核群集而设计的开源RISC-V处理器核的设计。我们介绍了指令扩展和微体系结构优化,以增加计算密度并最小化对共享内存层次结构的压力。对于典型的数据密集型传感器处理工作负载,由于采用了智能L0缓冲区以减少缓存访问争用并支持压缩指令,因此所建议的内核平均速度提高3.5倍,能源效率提高3.2倍。 SIMD扩展(例如点乘积)和内置的L0存储进一步将共享内存访问量减少了8倍,将争用减少了3.2倍。借助四个经过NT优化的内核,该集群可在0.6V至1.2V的电压范围内工作,并采用低成本65nm批量CMOS技术实现了67MOPS / mW的峰值效率。在低功耗28nm FDSOI工艺中,可以实现193MOPS / mW(40MHz,1mW)的峰值效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号